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THE  MOMENTS  OF  THE  SAMPLE  MEDIAN1’ 2 


by 

John  T.  Chu  and  Harold  Hotelling 
Institute  of  Statistics.  University  of  North  Carolina 

1.  Summary.  It  is  shown  that  under  certain  regularity  conditions,  the  moments 
about  its  mean  of  the  sample  median  tend,  as  the  sample  size  increases  indefinitely, 
to  the  corresponding  ones  of  the  asymptotic  distribution  (which  is  normal).  A 
method  of  approximation,  using  the  inverse  function  of  the  cumulative  distribution 
function,  is  obtained  for  the  moments  of  the  sample  median  of  a certain  type  of 
parent  distribution.  An  advantage  of  this  method  is  that  the  error  can  be  made  as 
small  as  is  required.  Applications  to  normal,  Laplace,  and  Cauchy  distributions  are 
discussed.  Upper  and  lower  bounds  are  obtained,  by  a different  method,  for  the 
variance  of  the  sample  median  of  normal  and  Laplace  parent  distributions.  They  are 
simple  in  form,  and  of  practical  use  if  the  sample  size  is  not  too  small, 

2.  Introduction.  Let  a population  be  given  with  cdf  (cumulative  distribution 

function)  F(x)  and  pdf  (probability  density  function)  f(x),  and  median  E which  we 

assume  to  exist  uniquely.  Let  x denote  the  sample  median  of  a sample  of  size 

2n  ♦ 1.  Then  the  pdf  g(x)  of  x and  the  pdf  h(x)  of  the  asymptotic  distribution  of 
«%* 

x are  respectively 

(1'  g(x)  = C^f  F(x)_7n/“1  - F(x)_7nf(x), 

where  = (2n  + l)j/(nj  nl),  and 

~n  This  paper  was  present  ad  at  the  summer  meeting  of  the  Institute  of 
Mathematical  Statistics  at  Montreal,  P.Q. , Canada,  on  September  11,  195U. 

2.  Sponsored  by  the  Office  of  Naval  Research  under  Contract  NR  0^2  031  for 
research  in  Probability  and  Statistics. 
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(2) 


h(x)  = {?n)~±/d  e 


-1/2  -(x“C)?/(2t-,?) 


where  u?  - {U/"f(£)_7":(2n  + 1)  j “1. 


A question  that  follows  naturally  is:  Can  the  moments  of  the  asymptotic  dis- 

<*>  ** 
tribution  of  x be  used  as  approximations  to  the  corresponding  moments  of  x,  and  if 

not,  how  to  find  better  anproximations?  When  the  parent  distribution  is  normal, 

this  question  has  been  answered  by  various  authors,  c.g.,  T.  Hojo  £"G  ~J > K.  Pearson 

r 8,  9_7  and  more  recently,  J.  H.  Cadwell  /~3  7*  It  has  been  stated,  e.g.,  in  /"" 3_7; 

J%r 

that  experiments  showed  that  the  distribution  of  x tends  rapidly  to  normality,  but 
the  variance  of  x (as  of  quantiles  in  general)  tends  only  slowly  to  the  variance  of 
the  asymptotic  distribution.  For  this  reason  of  slow  convergence,  approximations 
were  derived  for  the  variance  of  x when  the  sample  rize  is  small . Wrile  different 
methods  were  used  by  different  authors,  their  results  agree  fairly  well  with  each 
other.  In  pact,  the  problem  should  be  considered  as  completely  solved  but  for  the 
unknown  error  committed  in  using  such  approximation. 

One  of  us  f~h,  7 recently  proved  that  tho  distribution  of  x,  for  a normal 
parent  distribution,  does  tend  to  normality  rather  "rapidly".  In  \ 6 we  shall 
confirm  another  experimental  result  that  the  variance  of  x tends  ''slowly"  to  the 
variance  of  the  asymptotic  distribution  (actually  not  "very  slowly").  Upper  and 
lower  bounds  are  obtained  ( $ 6,  Theorem  U)  for  the  variance  of  x.  A slightly  better 
lower  bound  is  obtained,  by  a different  method,  in  $ 5*  formula  (I;9),  or  j 6, 
formula  (7U).  It  seems  that  even  for  sample  sizes  around  10  to  20,  the  asymptotic 
variar.ee  is  not  a bad  approximation  to  the  true  variance  of  x.  It  becomes  a very 
good  approximation  if  the  sample  size  is  large.  However,  for  large  samples,  an  even 
better  approximation  is  obtained  in  $ 5>  formula  (56). 

Before  further  discussion,  the  following  notations  will  he  introduced.  If 
f(x)  and  g(x)  are  functions  of  x,  then  E^(g)  denotes  the  expectation  of  e(x)  with 
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respect  to  f(x),  i,e.,  j g(x)  f(x)  dx.  ’Ve  use,  where  f,  g,  and  h arc  given  by 


(1)  and  (2), 


V-X  “ Er/x) 


H-,  - E,.(x)  = £ , 
JL  ll 


° E (x  - y^)  , 


and  for  any  integer  k > 2, 


- E-(x) 


Hi  “ 


“ Hi-)  , 


_ - sk 

-'u\A  " H'l  > > 


^ - Vx  - ■ 


“k  * Vx  ‘ '^l* 


It  should  be  pointed  out  that,  although  the  pdf  g(x)  of  x tends  to  h(x},  the  moments 
y_k  of  x in  general  do  not  necessarily  tend  to  uk<  In  fact  y-k  may  never  exist 

Nevertheless,  if  the  parent  pdf  satisfies  certain  conditions,  then  it  can  be  shorn 

A*  — 

that  p_k  tends  to  y.k  as  sample  size  tends  to  infinity  ( j3.  Theorem  l).  Therefore 
under  such  circumstance,  it  is  justifiable,  at  least  for  large  samples,  to  use  y,k 
as  an  approximation  to  y.k» 

If  the  parent  distribution  satisfies  certain  conditions,  a general  method  is 
obtained  in  \ h for  computing  y.^.*  k = 1,  ?,  ...  . The  method  is  based  on  the  Taylor 

expansion  of  x(F),  the  inverse  function  of  F(x).  For  example,  if  x(F)  - £ 

oo  *i  in  -Tti  k > 

= Z a (V  - ■*)  converges  for  0 < F < 1,  a =*  0(2  m ) where  k 0 and  f(x)  5s  sym- 

m ■ 2 n < 

m=l 

metric  with  respect  to  x = £,  then  when  n > 2k  + 3, 


■at 

ri  * 


u 


(5) 


l 

S2  C FU(1  - F)n  dF  , 
m n ' ’ 

0 


m . 

where  r is  given  by  (l)  and  S = Z a^(F  - i)  ( $ 4,  Theorem  3).  Error  ?::■  such 

m r=l 

approximation  can  be  computed,  and  it  tends  to  0 as  m tends  tc  oo.  If  the  parent 
pdf  is  not  symmetric,  similar  approximations  can  be  obtained  ( ) U,  formula  26). 
Applications  are  given  to  the  variances  of  the  sample  medians  of  Laplace  and  Cauchy 
parent  distributions  { $ U,  Examples  1 and  2). 

Finally  upper  and  lower  bounds  are  derived,  in  7 for  the  variance  of  x of  a 
Laplace  parent  distribution.  It  then  can  be  seen  that  for  estimating  the  mean  of  a 
Laplace  distribution,  the  sample  median  is  a "better"  estimate  than  the  sample  mean, 
net  only  for  large  samples,  but  for  small  samples  as  well. 


3.  Large  Sample  Moments. 


Lemma  1.  If  0 < c < — , then  for  m,  n = 1,  2,  ..., 


(6) 


I- 

( 


Uc‘ 


-i  m 

| U - ■*  j un(l  - u)  du 


(|) 


m+2n+I 


t(n.-l)/2  (1  . t)n  dt 


- - C 


0 


In  particular,  if  c 


1 

2 


and  Cn  = (2n  + l).'/nl  nj 


we  have  for  fixed  m. 


C | u - ~ 
n 1 2 


n,, 
u (1 


u)n  du  = 0(n 


(7) 


» 


} f ! 


, i i 


(8)  C (u 
1 n 


2m 


•5)  un(l  - u;ndu 


(i) 


2m 


1 . 3 ...  (2m  - 1) 


2 ( 2n  + 3 J ( 2n  + I 2n  + 2m  + l)  * 


-/  - 

a 


A 

1 


A 


II 


■If 


I f 


1 i 
; 

I 


■ 

~1 


(9)  Cn  |u 


1 2m-l 

i | un(l-u)ndu 


f-l^ra  1 • 3 * * * ( 2n+  2n-l ) (m  - l)l 

K~2  2’ U‘ ' -{Pn+Pm)  * T2n+3)(2n+5<1. . .(2n+2m-lJ 


These  formulae  are  easily  proved  using  transformations  v = + (u-rj),  etc. 

Theorem  1.  Let  a population  be  given  with  cdf  F(x)  and  pdf  f(x).  Suppose 
that  the  median  £ of  the  given  population  exists  uniquely  and  f(c)  y*  0,  and  f'(x) 
exists  and  is  bounded  in  some  neighborhood  of  x = £.  If  x is  tne  sample  median  of 
a sample  of  size  2n  + 1,  and  p^  and  JI.  . as  defined  by  (U),  are  respectively  the 

k ‘ moment  of  x about  its  mean  and  the  corresponding  one  of  its  ssymntotic  dis- 
tribution, then 


(10) 


(11) 


Lim  'x2k-l  ~ ^2k-l  , 

- *>-  r\r\  J 


n —*■  00 


Lim  ^ / P?k  = 1 , 

n — *»  00 


k - 1,  2,  . . . , 


provided  that  in  each  case,  p.^  1 an<^  are  ^or  l°ast  one  n» 

P:  oof.  Wo  will  prove  (ll)  as  an  illustration  of  the  method  we  use.  (10) 
can  be  shown  in  the  same  way.  Obviously 


2k -1 


<1?>  }*  - “k  * £ ( f)  d2k'3  ^ > 


where  |2kj  = ( 2k)  .*/  ( j l ( 2k  - j)  J ) . We  say  that  if  [Lov  is  fiuite  for  a certain 


2k 


i I 


l 


u 
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n = rip,  then  is  finite  for  all  n > n^,  and 


(13)  ^2m-l  = °(n_,n)  > 


a,  2m 

am  ^ - <-£> 


1 • 3 • • • (2m  - l) 


-m-l/2N 

+ <An  • ), 


for  m = 1,  2,  . . . , k, 


where  =*  l/f(|).  On  combining  (12),  (13),  and  (lh),  it  follows  that 


a.  ?k 

U5)  d?k  = (-y) 


1 • 3 *'*  (2k  - 1) 


+ 2k  + 


* 0(n-"-^)  . 


Since  p.~,  = 1 • 3 •••  (2k  - l)  jl^  , where  is  defined  by  (?),  we  have  (ll). 


To  complete  the  proof,  it  remains  to  establish  (ij)  and  (lU).  Now  for 


example. 


(x  - g)2™  Cn  /"F(x)_7n  jTl  - F(x)_7n  f(x)  dx 


a b 


= X1  + 12  + J3  * say, 


-oo  a b 


where  a < £ and  b > 5 will  be  chosen  later,  f'or  0 < F < 1,  the  function  F(l  - F)  is 

11  1 
ncr.-neeative . reaches  its  maximum  t at  ? “ ^ , and  is  increasing  for  0 < F < -^  and 
~ a c — — ^ 


decreasing  for  < F < 1.  Lot 


r = max  (UF(a)/"l  - F(a)_7,  i|F(b)/"l  - F(b)_7)  , 


then  0 < r < 1.  Since  = 0(2^nr?~'^) , it  follows  that 

(18)  xi  + t3  = 0(nl/2rn)  . 

On  the  other  hand,  if  a and  b are  so  chosen  that,  o.g.,  F(b)  - - F(a)  = c is 

small,  then  for  ■l-c<F<i+c,  x(F),  the  inverse  function  of  F(x),  is  uniquely 
defined  and  may  be  expanded,  by  Taylor's  method,  into 

(19)  x(F)  - 5 = a^(F  - l)  + H?(F  - |)2  , 

•!#lere  = l/f ) and  is  the  remainder.  Substituting  ( 19 ) for  x - £ in  of 

(16),  it  can  be  shown,  using  Lemma  1,  that  I?  is  equal  to  the  RHS  (right  hand  side) 
of  (1U).  Combining  this  fact  with  (l6)  and  (18),  we  obtain  (lU). 

Regarding  the  above  theorem,  we  make 

Remark  1,  A sufficient  condition  for  p^  being  finite  for  seme  n - n^  (hence 

all  n > n^)  is  that  p^  be  finite.  This  condition,  however,  is  not  necessary.  For 

example,  the  variance  of  the  sample  median  of  a Cauchy  parent  distribution  is  finite 
if  the  sample  size  2n  + 1 > 5>,  though  the  variance  of  the  parent  distribution  is 
infinite  ( ^ U,  Example  2). 

Remark  2.  Theorem  1 states  only  some  sufficient  conditions  under  which  (10) 
and  (11)  are  true.  For  a Laplace  parent  distribution,  f'(g)  does  not  exist,  yet 
(10)  and  (11)  hold  ( \ U,  Example  1). 

The  above  theorem  provides  a justification,  at  least  for  large  samples,  for 
using  p^  as  jn  approximation  to  p^.  In  the  next  section  we  will  proceed  to  show  that 

if  the  parent  pdf  satisfies  some  additional  conditions,  then  satisfactory  approxima- 
tions can  be  obtained  for  p,  for  samoles  of  smaller  sizes  as  well. 

_ k 
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ii.  Approximations . 

lemma  2.  If  k is  real,  then  the  following  series  is  convergent  f <-.?  every 
positive  integer  n > k, 


1 


( 20) 


oo 

2 

m-1 


m 


|2(F--|)|  C^d  - F)n  dF  . 


0 


Proof.  Use  Lemma  1 and  the  fact  jT  1,  p.  33  7 that  if  > 0 and  m(a^/am,,-l) 


co 


approaches  r > 1,  then  2 a is  convergent;  or  apply  the  Stirling's  approximation. 


m-1 


m 


with  m large,  to  the  gamma  functions  obtained  by  putting  c * in  (6), 

Theorem  2.  Let  F(x)  be  the  cdf  of  a given  distribution  and  £ and  x be 
respectively  the  median  and  the  sample  median  of  a sample  of  size  ?n  * 1.  Suppose 
that  x(F),  the  inverse  function  of  F(x),  is  for  0 < F < 1 uniquely  defined  and  equal 

to  a convergent  series  of  powers  of  F - 1st 


(21) 


oo  , m 

x(F)  - £ - 2 a (F  - 4) 

_ m c 


m=*l 


W»ito 


{22) 


m , r oo  , r 

2 ajF  - |)  , and  Rm  - 2 a„(F  - |)  . 

r=m+l 


m , r 
r=l 


If  there  oxists  a sequence  (bm)  such  that 


(23) 

00 

2 

2 

(a  /b  ) <oo 
nv  m 

m=l 

00 

(2U) 

2 

b^(F.|)  < 

for  0 < F < 1.  and 


9 


(?cn 


oo  0 ( 2m 

£ d j (F  - ■=)  C F"(l  - F)  dF  < oo  , 
n m I d n 


>,i=i 


i ' 


for  seme  positive-  integer  value  of  n,  and  u0,  the  variance  of  x,  is  finite  for 


n =■  n^,  then  for  all  integers  n > n. 


72' 


0> 


(?6)  (I2  = Lim 

• m — **  oo 


{ I S2  C F^l  - F)n  dF  - ( 
1 m n ' v 


S C 1^(1 
m n 


F)n  dF)?  ) 


Further,  if  f(x)  is  symmetric  with  respect  to  then  the  second  term  in  the  bracket 
should  be  omitted. 


Proof.  For  simplicity  we  assume  that  f(x)  is  symmetric  with  respect  to  x = 


In  this  case  jl^  = % 


and 


oo 


a b 


(27)  Up  = 1 (x  - K)  1 - F(x)_7nf(x)dx  = 


J 


-00 


! + 

1 

-00 


oo 


a b 


" *1  + h + *3  > 


where  a < l and  b > 5 will  be  chosen  later.  It  can  be  shown  that 


(28)  I,  + I3  = 0(nl/?rn)  , 

where  r is  defined  by  (17).  Choose  a,  b,  and  c such  that  0 < -i  - F(a)  = F(b)  - i 
=>  c < Usinp  (21)  and  (?2),  we  have 


10 


i 


I 

! i | 


(29)  |I, 


where 


(30) 


(31) 


(3?) 


(33) 


/ 

0 


S2c  Fn(l 
T.i  n 


- F)ndFj  < J + + J,  + J,.  , 


J,  = 

_L 


S2C  F^l  - F)n  dF  , 
m n v ’ 


o + c 


1 

2 ” c 

! j 

"nTn*  ^ 


F)n  dF  , 


2 + C 


R^n^'U  - F)n  dF  , 


2 “ C 


2 + c 


J,  = 


2 |S  R |C  F“(l  - F)n  dF 
mm  n 


1 

2 ‘ c 


By  Schwarz's  inequality,  we  get 


(310 


oo  a 2 oo 


2m 


n a c.  0 -i  c.m  -»  „ -t 

J *J  <6(i-c)Z  (#-)  • U | (F-|)  V^-d  - F)-1 

m=l  m m=l  ! 


dF 


i 

! 


, « 


By  (23)  and  (23),  the  two  series  on  the  RHS  are  convergent  for  n > n^.  Hence  if 
n > nQ,  J1  + J?  tends  to  0 as  c tends  to  \ . 

Further,  from  (32), 


a 


11 


(35) 


oo  o 2 oo 
J3  f 2 (r^)  * 2 

r=m+l  r r=m+l 


b2(F  - C 1^(1  - F)n  dF 


(36) 


oo  a 2 

JU  2 2A  2.  <^> 

r-x  r 


oo 

Z 


r=m+l  r 


£>7  ? 


on  . i 
2 br  j <F 


r=*l 


\)2r  CnP"(l 


- F)rl  dF  . 


As  m tends  to  infinity,  tends  to  0.  Consequently,  for  any  fixed  n > n^. 


(37) 


( 

IL  - Dim  j S^C  F^l  - F)n  dF  . 
2 rc— *oo  i mn 
0 


An  immediate  consequence  of  Lemma  2 and  Theorem  2 is 


Theorem  3.  If,  in  the  preceding  theorem,  a^  = 0(2%^)  for  some  integer  k, 
then  (?6)  holds  for  every  n > 2k  + 3. 

Proof.  Choose  b = 2mm^+\ 

m 

Thus  we  have  found  an  approximation  for 


(38) 


1 

( 

I S2C  Fn(l  - Fln  dF  , 
I m n * 

0 


for  all  integers  n which  are  not  too  small.  The  integral  on  the  RHS  of  (38)  can  be 
evaluated  by  formulas  given  in  Lemma  1.  An  upper  bound  for  the  error  committed  in 
such  approximation  is  given  by  the  sum  of  the  RHS  of  (35)  and  (36),  Finally  we 
note  that  the  same  method  can  be  used  to  obtain  the  moments  of  x in  general. 

Example  1.  Laplace  Distribution.  Let  f(x)  = ~ e=^X’,  then  F(x) 

= 1-  i eX  if  x > 0,  and  F(x)  * ~ eX  if  x < 0.  Hence 


1? 


1 


(39) 


L = ? I x2C  F-(l  - F)n  dF 
. £ n 


1 

2 


1 00  1 1 1 
If  ? f F < 1,  thon  x = - log  2(1  - F)  = 2 nf1/" 2(F  - 7".  So  a - Ai  . It 

c T — 2 — IT. 


m=l 


follows  that  for  n > 1, 


(Uo) 


m 


jl2  = Lim  ? | (2  r-1/"2(F 


m— > oo 


r=l 


i)  7r  ) C F^l  - F)n  dF 
c.  — n 


1 

2 


Oil) 


oo 


2 w 
ra=»l 


m 


, m+1 

I 2(F  - -i)  j 0^(1  - F)n  dF  , 


whore  /~1,  p.  8U_7> 


(U2) 


m 


-1 


w = 2 r (m  - r + l)’ 


ia 


r-i 


-1  m -1 
2(m  +1)  2 r 

r=»l 


If  we  use 


(U3) 


2k-l 


ra+1 


t-rw  — J.  1 -y  IIF1 

- 2 wm  J?(F  - ^)|  CnFn(l  - F)n  dF  , 


ripl 


then  the  error  committed  is  bounded  by 


(U3a) 


2n"1/?w2k(1  * 1H)(n  + k)’1/2 


( 2n'  1 X 2n  O ) .. . ( ?n+2k-l7  * 


In  deriving  (U3a),  wc  used  the  facts  that  is  a monotonically  decreasing  sequence 


I ' 


.IT 

l 


i ' 


l 8 


I 


r i 

i ■ i 


! i 


! 


! I 
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of  m £~1,  p.  8f?_7  and  tho  Wallis  product  jTl,  ?.  383  7 


1 • 3 •••  ( 2n-l)  1/2 

? • U • • • C?nT  n 


-1/2 
n ' 


is  a monotonically  increasing  sequence  of  n.  Similarly  if 


(hi,) 


?k 

I w j | ?(F 

m=l  n J 

0 


, m+1 

\)  I 0^(1  - F)n  dF  , 


then  the  error  is  bounded  by 


(UUa) 


2k  p ♦ A.)  3» » «( 2k+l) 

2k+l',~  2n'  ( 2n+l ) ( 2n+3  J7. . ( 2n+  2k 


Example  2.  Cauchy  distribution.  Lot  f(x)  =»  l/n(l  + x^),  then 


F(x)  = n”^/” tan  ^x  + for  -oo  < x < oo,  so  x(F)  = tan  n(F  - -i)  for  0 < F < 1. 


It  can  be  shom  that  the  variance  of  the  sample  median  of  a sample  of  size  2n  + 1 > 5 
is  finite: 


(U3) 


il2  - ! tan2  n(F  - i)  0^(1  - F)n  dF  . 


/ 

0 


n 


It  is  known  £~7t  pp.  20U,  237_7  that 


(U6) 


oo 


tan  x = 2 (-1) 

m=l 


m-1  2?m(22ra  - 1) 


nsrr~  b^'1  - -r°- ' * i < 5 • 


where 


4-  ; 


Ill 


3„  = 2(-l)' 


l-l  (am)! 


V —Cull 

l r 


(?0'  r=l 


Wg  S9G  th £l"t’  3 = 0{  2^)  > hence  by  Theorem  3,  (3?)  holds  if  n > 3. 


5.  Normal  distribution.  Throughout  this  section,  f(x)  = (Pn)  ' e 


-1/2  -x7? 


F(x)  = j f(t)  dt,  and  ::(F)  is  the  inverse  function  of  F(x).  No  simple  general 
-oo 


form  of  the  derivatives  of  x(  F)  at  F ■ - is  known.  But  the  first  few  derivatives 
of  x(F)  can  be  obtained  by  direct  differentiation,  e.g.. 


dx  1 d x 1 d /_1  \ 

dF  = fT5E7  ' TT?  7Tx7  SxC7CxV  > 


For  finite  m and  0 < F < 1,  let 


(U7)  x(F)  = a1(F  - vj)  + a?(F  - ij)?  + ...  + am(F  - -|)m  + Rm+1(F  “ ^)m  1 > 


Zip  | 


= (2ti)^^?/3J  , - 7{2r>)^^  /$'.  , ..., 


I ^—2/n 

Ru  - -‘n1’"  /"(7  + U6x£  + 2^)0^  7*  , ...  , 

J P • - Q 


where  /”g(x)_7p  g/”x(Fe)_7,  F0  = i + 2111(3  0 - e 


A, 

vt 
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A.  A Lower  Bound.  Take  the  integral  (39,',  let  the  range  of  integration  be  divided 

111  1 

into  two:  — to  — + c,  and  “ + c to  1,  where  0 < c < -r  . F we  neglect  the  last 

c C £ C 

integral;  in  the  first  integral,  replace  x(F)  by  its  expansion  ( U 7 with  m = 6,  and 
then  neglect  all  terms  containing  the  remainder  term  FL  (which  is  non-negative),  and 


finally  let  c approach  and  use  Lemma  1,  we  are  able  to  obtain  a lower  bound  ior 

<v 

the  variance  of  x of  a normal  parent  distribution  with  unit  variance,  i.e.. 


(h9) 


~ , i—-  n 13*1^  y 

^2  - XP  L ■■  * 2\ 2:i+?)  ’ 2i*i2n+5i(2n+7)  * 


where 

(50)  X2  = h/2(  ?n  4-  3)  . 

Incidentally,  for  n > 3,  we  may  expand  the  KHS  of  (1*9)  into  a power  series  in 
(2n  + l)-~  and  ot tain  an  approximation  for 

(51)  £2  - \L2  f1  - (2  - |)(2n  + l)"1  - (3n  - h - 13n2/2U)(?n  + l)"2  + ...  J, 

where  is  given  by  (2).  In  terms  of  standard  deviations,  and  with  the  numerical 
values  of  the  coefficients  computed,  (£l)  is  equivalent  to 

(5?)  p?/?  ~ f1  ' (.?lU6)(2n  + l)"1  - ( ,0806)( 2n  + l)"2  <•  . . . _7  . 

This  agrees  with  a formula  obtained  by  K.  Pearson  for  the  same  purpose  /~8,  D.363  ~J . 
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E.  Approximations  for  large  s amplcs. 


oo 


oo 


(53)  il0  - 2 j x2C  /”F(x)  7 nr  1 - F(x)  7n.f(x)dx  = 2 j + 2 j - L, 

■ I n-  — — ~ I ! - 


) 


i2>  say. 


Since  F(x)7”"l  - F(x)_7  < F(a)/“l  - F(a)_7  if  x > a > 0 and  Cn  < (2n)  1 

+ (Pn)"1  7(2n  + l)1/222n+1,  it  follows  that 


/ 

(5U)  I^A2  5 ( 2/ti)3/2(1  + 3/?n)(2n  + D^/Vta)  (l-F(a))_7n2  j x?(2n) 


J 


1 _ x2 
2e  dx. 


^3 


In  1^,  uso  F as  the  independent  variable,  and  replace  x(F)  by  a1(F  - ^)  + a^(F  - 2) 


+ M-(F  - ~)5  where  M-  = Max  L=  (2nP/<:k/$i  and  A = (7+U6a^+2Ua4)e:>u  f*t  then 
5 5 0<X<a  5 


^5/2, 


Ao>.  A 


T a ” . f^\2,2k  . 1 ^ 3 • 5 

J1A2  - 1 + 2U^1  + 2 (TT  + ~2} 


. /«\3  _2A  3 * 5 • 7 

''2;  31^7  ^ 2n+5 ) ( 2n+7 ) ^ 2n+9 ) 


, /Ti'sU  / A ^2  3 • 5 • 7 • 9 

+ ¥ (TT  (Sa+5)  .'.."CM+H7  • 


Combining  (U9),  (53),  (5U),  and  (55),  we  conclude: 


t 


(56) 


V-r 


First  Approximation:  - gCgfiT  'jj 


Second  Approximation: 


X2^""1  + 2(  2n  + 5)  J ‘ 


If  the  second  approximation  is  used,  an  upper  bound  for  the  proportional  error  (de- 
fined to  bo  j (True,  value  / Approximation)!-  1 ) is  given  by  the  sum  of  the  RHS 


Hi 


YT~f' 


of  (5U)  and  the  Inst  three  terras  of  that  of  (55).  If  the  first  approximation  is 


used,  then  there  is  an  additional  error  n/2(2n+5),  the  second  term  in  the  bracket 
of  the  second  approximation. 

The  following  table  is  given  for  illustration.  We  choose  successively  for 
a:  .35,  .50,  .65,  .75.  It  is  to  be  noted  that  the  RHS  ox'  (5U)  is  a decreasing 
function  of  n fer,  o.g.-,  n > 25,  a = .75.  The  RKS  of  (55)  is  obviously  also  a de- 
creasing function  of  n.  Therefore  what  Table  1 means  is  that!  o.g.,  for  sample 
sizes  > 5l  (not  just  - 51),  if  the  first  approximation  is  used,  then  the  proportional 
error  is  < .092,  or  explicitly:  1 < < 1-092. 

TABLE  1 

Proportional  Error 

Sample.  Size  First  Approximation  Second  Approximation 


501 

3.  2 x 

10"3 

6.8  x 

10’? 

201 

8. 5 x 

10"3 

7.8  X 

lo’k 

101 

2.2  x 

10’ 2 

6.9  x 

1C’3 

51 

9.2  x 

io’2 

6.3  X 

10"  2 

*»  n 


lO 


6 . Normal  cl i S t*r ibu t Ion  --  a uilTerunt  approach . 
is  used  to  derive  upper  and  lower  bounds  for  the 
distribution  with  unit  variance.  We  state 


.ui  this  section  a different  me  thou 
variance  of  x.  of  a normal  parent 


Theorem  U.  Let  x and  be  respectively  the  sample  median  and  its  variance 
of  a sample  of  size  2n  + 1 drawn  from  a normal  distribution  with  unit  variance.  If 

* , At 

= n/2(2n  + 1)  is  the  variance  of  the  asymptotic  distribution  of  x,  then 


(57) 


B (1 
n 


?n  + 2 


3/?  _ 

) < P2/  - 


D (1 
n 


3/2 


T . On-t-l  t /o  i /o 

where  Bn  = C^|rn  1(2n;  ' V(?n  + l)x/ ", 
all  practical  purpose  and  n > U, 


and  C - ( 2n  + l).’/(n! 

n 


nJ). 


Further,  for 


(58) 


1 + 


7n+3 

2Un2(2n+l) 


< B < 1 + 
n 


Bn  + ' 


or 


(5?) 


B - 1 + ip  . 

n on 


Proof,  By  using  the  following  transformations  consecutively. 


(60) 


u = F(y)  , 


v = u 


1 

2 


where  F(y) 


, j p 

j (2n)”^2  e”x  ^ dx  , we  obtain 

I 

-oo 


ho  = 


2Cn(l> 


2n 


A/2 


y2d 


O n 

Ijv;  dv  . 


(61) 


Let 


JU 


(6?) 


1/2 


■ill-  e-t  /(2n*l))  ' ( 


:nen 


oo 


(63) 


’ 2Bn 


(Jn)-V?  y2  c-(n+l)tB/(2n+l)  hi(t/(2n,1)l/2)  dt  , 


where  h^(t)  - t(l  - c ) >1  for  all  t > 0.  Further,  it  is  known  /~\0j  that 


if  I \ 

V04/ 


(2>0'1/2  e-x?/2  dx  < ± (1  - 0-2//")1/S 


/ 

0 


o O 

Using  (6U),  it  can  be  shown  that  y > (iyt  . Therefore  it  follows  from  (63)  that 


(65) 


1 5/2  _ 

^2  - Bn^1  " 2n  + 2*  ^2 


On  the  other  hand,  we  have  from  (63), 


00 


(66) 


Uo  = 


II  C 


{2n)-1/>2t2  G"“wWi) 


• t(?/n)y?h2(t/(2n+l)1/2)dt)  , 

_t2  2 1/2 

where  h^(t)  = e"  /t(l  - e-T;  ) . If  we  can  show  that  (?/n)y2h0(t/(2n+l)1''2)  < 1 


for  all  t > 0,  then 
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Li 


1 3/2  _ • 

..  < r ft  + .i-1)  ,, 

**2  - n 2n  ^2  * 


Now  y<:h2(t/(?n  + l)1^2)  < g0(y)  where 


2\  a ? 


G0(y)  = y (1  - Uv^)/ijv 


It  can  be  seen  that  Lim  gn(y)  = n/2.  Hence  it  suffices  for  our  purpose  to  show 

y— > 0 


that  g-jCy)  is  decreasing.  Let  " 1 " denote  differentiation  with  respect  to  y.  Then, 


gn(y)  = (y/^v3)  g]L(y)  , 


, V . , ?.  I 

g^y^  « v(l  - uv  ) - y v . 


I I p n 

g^y)  * g2(y)  v * whGre  g2(y)  = y"  - i2v  i 


g?(y)  = (12/n}e  - g^(y)  , where 


g^(y)  = (n/6)y  ey  - e 


y2 . //*  f 0-*2/2  ^ 


It  is  known  f~  10  that 


ey  /2  I c~x  /2  dx  - 2 y2nJ'1/l*3...(2n+l) 


^4- 


. It  can  be  shown,  by  a similar 


oo 


Hence  g (y)  = 2 < 3JF  “ 

J n=0  n*  x J 


argument  used  in  /”i0  ~J  i'or  a 


1 » ?n+I 

• • • ( 2n+l)  y 


similar  purpose,  that 


(73) 


g3(y)  = y3  fa0y~Z 


ai + 


where  < 0 and  a..  > 0,  i = 1,  2,  ...  . Hence  there  exists  a y^  > 0 such  that 

g^(y)  < 0 if  0 < y < y0  and  g^(y)  > 0 if  y > yQ.  So  as  y increases  from  0 to  oo, 

g2(y)  decreases  steadily  from  0 to  a minimum  and  then  increases  steadily  to  oo. 

Consequently  g^y)  first  decreases  steadily  and  then  increases  steadily.  As 

Lim  g, (y)  * Lim  gn(y)  = 0,  it  becomes  clear  that  g, (y)  < 0 for  all  y > 0. 
y — > c y — > oo 

Therefore  gn(y)  is  a decreasing  function  of  y.  This  completes  the  proof. 


Finally  we  note  that  ($8)  is  obtained  by  using  nj 


-(Sic)inn+l/2o'n+(12n) 


-1 


Remark  1. 
by  (5?)  if  wc  use 


The  lower  bound  for  \i?  given  by  (h9)  is  better  than  the  one  given 
($9)  for  B . This  i?  so  oven  if  the  last  term  at  the  RKS  of  (1*9) 


is  omitted.  For 


/7I,\  n , n2  n_  r-  (8-2n)n  * 20  - « 7 

^ ?(2n+3)  U(2r+3K2n+£)  2T2v*TJ  - 1 2(2n+3K?n+£)  - 
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Now  if  n > 2,  the  last  term  in  the  bracket  of  (7*0  is  smaller  in  absolute  value 

1 i 1 

than  (2n+2)  and  (1  + £~1  - l/(2nt2)_7J  < 1.  Therefore  the  quantity  in  the 

bracket  of  (7*0  is  greater  than 

3 

1 " 2n  + 2 - ^ + ” 2n  + 2^  ” * 

Tor  n = 1,  direct  comparison  shows  also  that  the  LHS  of  (7*0  is  greater  thar.  that 
of  (57). 

Remark  2.  Since  both  lower  bounds  for  u^,  (*4-9)  and  (57)  are  smaller  than 

Ug,  while  the  upper  bound  is  greater  than  p^  (1  + l/2n),  we  cannot  be  6ure,  just 

by  comparing  these  bounds,  that  in  using  p^  as  an  approximation  to  p^,  the  propor - 

tional  error  is  less  then  l/2n.  Therefore  the  second  approximation  given  by  (56) 
is  better  tnan  u 2,  for  large  samples.  (Table  1). 

7^  distribution.  We  shall  now  employ  the  some  technique,  used  in  \ 6, 

to  derive  upper  and  lower  bounds  for  the  variance  p^  of  the  sample  median  x of  a 

sample  of  size  2n  + 1 drawn  from  a Laplace  distribution  with  pdf 
(75)  f(x)  = \ e 


Clearly,  the  variance  in  this  case  of  the  asymptotic  distribution  of  x is 


(76) 


^2  = 2n  + 1 


Tl- 

W c ouuwc 

A#  ,«• 

Theorem  5*  If  pp  and  p^  are  as  defined  above,  then 


— 
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(77) 


B (1  - — — )2 
n'  2n+2; 


£p/P2  ^ i-Sl  * fe>2 


where  is  given  by  (57)  and  (59)- 

Proof.  It  can  be  seen  easily  that  p2  is  equal  to  the  RHS  of  (63)  if  v and 
t satisfy  (62)  and 


(78) 


v = 


1 -X  , 

7,  e dx. 


We  proved  4_7  that  for  all  y > 0, 


{ 7Q^ 


v < | (1  - e’yV  . 


i I 

I ! 


From  (62)  and  (79).  we  have-  y > jl  Hsr.cc  it  follows  from  (63)  that  M2/u2  has 

a lower  bound  given  by  ( 77 ) • 

Further,  from  (63) 

00 


(80)  n2  = 2BQh2 


(2n)"^  t2  e"nt  //(2n+l)  f y2h2(t/(2n+l)^)  dt  }, 


t 


where  h2(t‘)  is  given  by  (66) 


(81) 


We  say  that 


rj  i / 3' 

y_hg ( t / (2rr*-l)2)  < 1,51 


If  this  is  true,  then  the  RHS  of  (77)  is  an  upper  bound  of  d2/u2.  Thus  the 
proof  is  completed. 


* 


V 


To  establish  (oi),  we  introduce,  as  in  (68), 


gQ(y)  = y20  - Uv2)/Uv2 


where  y and  v satisfy  (73).  For  all  y > 0,  gQ(y)  is  not  smaller  than  the  LE3  of 
(8i)  and 


Lim  gQ(y)  = 


1 0 
as  y > 


Let  denote  differentiation  with  respect  to  y,  then 


g0(y)  » gpiy)  > where 

2v 


gx(y)  = V - 4v5  - | y e"y 


gx(y)  = I e“y  s2(y)  > 


where 


g.(y)  = - 12V*  + y 


1. 


s2(y) 


= - 12  V e“y  + 1 = g,(y) 


12  e-y  (i  - e-y)  . 


If  f(x)  is  a function  of  x,  and  ir  as  x increases  from  0 to  oo,  f(x)  varies  from, 


e.g.,  positive  to  negative,  and  then  back  to  positive,  we  will  write,  for 
simplicity.  As  x:  0 > oo,  f(x):  + . Now 


(90) 


t > > 

g^(y)  ^ 0 according  as  y ^ log  2, 


and  g^(0)  = g^{oo)  = 1,  and  g(log  2)  = - So  as  y:  0 > oo,  g^(y): 

H°w  62(0)  = °>  while  gg(oo)  = oo.  We  say  that  as  y:  0 > oo,  gg(y): 

Otherwise  gg(y)  > 0 for  all  y > 0,  so  g1(y)  > 0 and  g1(y)  > 0 for  all  y > 0,  as 

g^(0)  = 0.  Eence  gg(y)  is  steadily  increasing.  Phis,  however,  contradicts  (83). 

It  follows  that  as  y:  0 — > oo,  g^(y):  + ,~,+  . Now  g^O)  « g.^00)  = 0,  hence  as 

y:  0 > oo,  81(y):  + ,-.  Therefore  we  conclude  that  as  y:  0 > oo,  g^y) 

increases  steadily  from  1 to  a maximum,  and  then  decreases  steadily  to  0.  To 
find  the  maximum  of  gQ(y),  we  first  solve  g^(y)  = 0,  which  is  equivalent  to 

2v(i+2v)  - y = 0.  Using  table  /~12_7.-  we  obtain  an  approximate  solution  y = i.ip. 
The  maximum  of  gQ(y)  is  then  found  to  be  1.31. 

Remark.  The  variance  of  the  sample  mean  (of  a sample  of  size  2u+l)  drawn 
from  a Laplace  distribution  with  pdf  given  by  (75)  ia  2/(2n+l).  It  follows,  from. 
Theorem  5,  that  the  sample  median  has  smaller  variance  than  the  sample  mean  for 
sample  size  2n+l  >7.  In  a recent  paper,  A.  E.  Sarhan  found  that  for 

sample  sizes  equal  to  2,  3>  and  5,  the  variance  of  sample  median  is  also 

smaller  than  that  of  the  sample  mean. 
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